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STATEMENT OF FOCUS 



The Wisconsin Research and Development Center for Cognitive Learning 
focuses on contributing to a better understanding of cognitive learning by 
children and youth and to the improvement of related educational practices. 
The strategy for research and development is comprehensive. It includes 
basic research to generate new knowledge about the conditions and processes 
of learning and about the processes pf instruction, and the subsequent de- 
velopment of research-based instructional materials, many of which are de- 
signed for use by teachers and others for use by students. These materials 
are tested and refined in school settings. Throughout these operations be- 
havioral scientists, curriculum experts, academic scholars, and school people 
interact, insuring that the results of Center activities are based soundly 
on knowledge of subject matter and cognitive learning and that they are 
applied to the improvement of educational practice. 

This Technical Report is from the Basic Pre-Reading Skills: Identi- 

fication and Improvement Project in Program 1. General objectives of the 
Program are to generate new knowledge about concept learning and cognitive 
skills, to synthesize existing knowledge, and to develop educational ma- 
terials suggested by the prior activities. Contributing to these Program 
objectives, this projects basic goal is to determine the processes by 
which children aged four to seven learn to read and to identify the 
specific reasons why many children fail to acquire this ability. Later 
studies will be conducted to find experimental techniques and tests for 
optimizing the acquisition of skills needed for learning to read. 
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ABSTRACT 



Statement of the Problem 



The purpose of this study was to determine the pronunciations 
children give to synthetic words containing vowel cluster spellings, 
and to analyze the observed pronunciations in relation to common 
English words containing the same vowel clusters. The pronunciations 
associated with vowel cluster spellings are among the most unpredictable 
letter-sound correspondences in English. If learning to read includes 
learning to translate from spelling to sound, then vowel clusters should 
pose a particularly difficult problem for children. Determining the 
manner in which children solve this problem — i.e., the factors related 
to children* s pronunciations of vowel clusters in unfamiliar words — 
could shed more general light on this complex decoding act. 

The study dealt with the following independent variables: grade 

level (second, fourth, and sixth), sex, reading level (high and low), 
community type (suburban, urban, and rural), vowel cluster (a subset of 
nine — ai, au, a^, ea, jie, oja, oo, £u, and ow) 0 and response type 
(principal and secondary). The dependent variables were difference 
scores between the subjects* principal and secondary pronunciations of 
vowel clusters and the principal and secondary pronunciation proportions 
of vowel clusters on two corpora — a 1963 modification of the 20,000 
most common words on the Thorndike Frequency count (Type Corpus) 
and the most frequent 1,000 words on the 1967 Kucera and Francis 
computational analysis of present-day American English (Token Corpus). 

Procedures 



Two pilot studies were conducted to refine and modify the testing 
instrument, a 100 item multiple choice test. The instrument included 
90 synthetic words containing vowel clusters, (ten synthetic words for 
each of the nine selected vowel clusters) and ten check items. Four 
real word distractors contained the major pronunciations for the vowel 
cluster on the type and token corpora. 

The sample consisted of 436 elementary pupils from a suburban 
an urban, and a rural community, all in Wisconsin. Second, fourth 
and sixth grade boys and girls of both high and low reading levels 
were included. Each subject responded to two 50 item halves of the 
instrument on two consecutive days. 

To test twelve hypotheses and answer three questions two analyses 
were performed. In each analysis the design was a3x2x2x3x8 
(or 7) x 2 analysis of variance, in which the main effects were grade, 
sex, readir g level, community type, vowel cluster (eight on the type 
analysis and seven on the token analysis) and response type. 
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Results 



1. Grade level was significantly related to vowel cluster pronun- 
ciation. There was an upward progression from second to sixth 
grade in the proportion of principal vowel cluster pronunciations 
given in both analyses. 

2. There were no significant sex differences in either analysis. 

3. Subjects of high reading level consistently gave more principal 
pronunciations to vowel clusters in both analyses than the 
poorer readers. 

4. Suburban subjects tended to give the principal pronunciations 
of vowel clusters more consistently than urban and rural 
subj ects . 

5. Subjects’ pronunciations were more closely related to word 
types than to word tokens, particularly to the principal pro- 
nunciations in the type corpus. 

6. Word configuration seemed related to vowel cluster pronunciations 
Conclusions 



1. As children progress through the grades, their vowel cluster 
pronunciations more closely parallel the correspondences 
occurring in common English words- 

2. Being a preference inventory, not a test of "correctness" , 
sex differences were not significant. 

3. Better readers are less deviant from correspondences in 
common words in their pronunciation of vowel clusters than 
are poorer readers. 

4. Suburban children tend to more closely approximate the vowel 
cluster correspondence frequencies in common words than urban 
and rural pupils. 

5. Principal pronunciations of vowel * clusters in word types relate 
more' closely to children’s pronunciations than do the cor- 
respondences in word tokens. 

6. Contextual environment and word postrion seem to influence 
vowel cluster pronunciations by children. 
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It has been said that the act of learning to read is perhaps the great- 
est intellectual feat of anyone f s lifetime, and teaching people to 
read has been the concern of educators since the development of the 
first alphabet. In spite of this long history of teaching reading, 
there is yet no universally accepted definition of "reading." 

The history of reading instruction in this country, perhaps more 
than any other educational endeavor, has been characterized by a variety 
of methodologies and by missionary zeal. More research has been done 
on reading than any other school subject. Reading materials appear, 
flourish, and fade with amazing rapidity. Phrases come and go. Ten 
years ago "decoding" was uttered only by the unenlightened-- today its 
popularity is immense. 

In the past decade the field of reading has felt the influence of 
linguistic science, and the impact has been profound. Linguists study 
language, and their discoveries and theories have offered insights to 
those who would help children learn to read* Linguists have isolated 
the features of language, and have described their functions and 
relationships. Studies have been done of phonology, morphology, and 
syntax, of pitch, juncture and stress, and of competence and 

performance and of countless other related areas. Some studies have 
dealt with the relationship between orthography and phonology. 
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Language is a system of oral codes through which humans communi- 

and audible 

cate, and writing is a graphic representation of these oral / symbols . 
Children learn to listen, speak, read and write — usually in that order, 
and most develop considerable aural/oral facility before any formal 
reading instruction begins. They have learned to discriminate and 
articulate most of the phonemes of their native language, and to com- 
prehend and generate meaningful sentences before they enter school. 

Of all the skills of language and thought, perhaps the only one 
unique to beginning reading is the ability to translate what is written 
to oral language already possessed. Symbols represent sounds; unfortu- 
nately, some symbols represent many sounds, and some sounds are 
represented by many symbols in English. Recent studies (Venezky, 1966; 
Hanna et al,, 1966) aided by computer technology have tabulated 
correspondences between spelling and sound and sound and spelling in 
common English words. However, little research has been done to deter- 
mine whether or not these relationships are actually used by competent 
readers — and if they are, how children acquire them. 

If initial reading includes the translation from spalling to sound, 
and if these symbol-sound correspondences are not one to one, research 
is needed regarding childrens pronunciation behavior. When a child 
encounters an unfamiliar written word, what factors influence his 
choice of pronunciation? The present study is concerned with this 
question. 
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Chapter I 

STATEMENT, BACKGROUND AND RATIONALE OF THE PROBLEM 



Statement of the Problem 

This study was designed to investigate elementary pupils* pronun- 
ciations of selected vowel clusters for which predictable letter-sound 
correspondences rarely exist, and to compare the observed pronunciations 
to pronunciation frequencies of vowel clusters in common English words. 

In more specific terms, the investigation dealt with the following 
questions : 

1. How well do children* s pronunciations of vowel clusters in 
synthetic words approximate the actual pronunciation frequencies 
of the same vowel clusters? 

2. What differences are there in the vowel cluster pronunciation 
frequencies of good and poor readers? 

3. Do boys and girls differ in their pronunciations of vowel 
clusters? 

4. What differences are there in the vowel cluster pronunciations 
of second, fourth, and sixth grade subjects? 

5. Do children of different community types differ in their 
pronunciations of vowel clusters? 

6. Will children's pronunciations of vowel clusters be more 
closely related to the letter-sound correspondences on a 
type corpus or a token corpus? 

7. Will word position or consonant environment affect the pro- 
nunciation of vowel clusters in synthetic words? 
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B ackgroun d 

Educators who are concerned with the improvement of reading 
instruction are rarely satisfied with the status quo. For years the 
field of reading has been characterized by the ubiquitous debate over 
beginning reading methodology. Proponents of such methods as language 
experience, x^hole-word , individualized instruction, synthetic phonics, 
linguistic, and modified alphabets have been prolific in their research, 
writing and speaking. 

The publication of Chall’s survey (1967) generated increased 
polemics, for her conclusions tended to dispel t, convent ional wisdom". 
Admitting that no single approach to beginning reading was all good or 
all bad, she nevertheless concluded that ". . . the first step in learn- 
ing to read one’s native language is essentially learning a printed 
code for the speech we possess" (p„ 83). A growing number of reading 
specialists (Clymer, 1968; Goodman, 1964; Burns, 1965; Betts, 1964; 

Lamb, 1968; and others) and linguists (Fries, 1963; Bloomfield, 1961; 

Hall, 1961; Venezlcy, 1966; Weber, 1968; Weir, 1964; and others) are 
generally in accord with this view. 

This study was not designed to compare methodologies in either 
the "meaning emphasis" or "decoding emphasis" philosophies, but was 
intended to examine in detail one aspect of the "code." American 
English uses more than 40 phonemes, depending on regional dialect, 
represented in a variety of ways by the 26 letters of the Roman alphabet. 
The net result is several hundred letter-sound correspondences (Bronstein, 
1960). Whatever method of reading instruction is used with the 
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beginning reader, the child must somehow develop the ability to 
translate the written form of English into its oral counterpart. 

He must sooner or later be taught--or discover for himself-- the code. 

Only a knowledge of the code--the relationship between some two 
dozen letters and 40 or more sounds-**wi 11 permit readers to increase 
their reading fluency and vocabulary. Without this understanding and 
its inherent transfer, each word would have to be memorized. In fact, 
studies show that good "whole-word" readers have discovered and use 
letter-sound correspondences (Bishop, 1964). 

Language has been dissected in various ways by linguists and 
educators. Bloomfield ian linguists refer to four levels of language: 
the phonemic, morphemic, syntactic, and semantic (Hockett, 1958). 

The transformational-generative linguists of the Chomsky school speak 
of competence and per f ormance-- the deep structure and surface structure, 
ahd the syntactic, lexical, semantic, and phonetic components (Chomsky, 
1957), while the importance of the suprasegmen tal phonemes of stress, 
intonation, and juncture, is advanced by Trager and Smith (1957). 
Regardless of one's notions of the structure and component parts of 
the language, the graphic representation of language, the orthography, 
is the barrier which must be crossed in learning to read. The ortho- 
graphy--and its relation to sound--is the most important feature of 
language for anyone learning to read. 

Mathews attributes the extraordinary greatness of the Greeks to 
the acceptance of this linguistic fact: 
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Although various peoples had been writing for thousands of years 
before the Greeks, the latter outstripped all those who had preceded 
them in this field. The secret of their phenomenal advance was in 
the vividness of their conception of the nature of a word. They 
reasoned that words were sounds, or combinations of ascertainable 
sounds, and they held inexorably to the basic proposition that writing, 
properly executed, was a guide to sound* Their firm rdherence to this 
view caused them to be dissatisfied with the failure if the Egyptians 
and the Semites to take full account of all the speed- sounds, the 
vocalic ones in writing being slighted. 

Other peoples, such as the Babylonians and the Egyptians, had 
caught glimpses of the desirability of having signs represent sound s , 
not things , but they were never able to break with convention to the 
extent of setting aside picture writing in favor of letter writing. 

The fundamental defect of picture writing was that it was not based 
upon sounds at all. The Greeks saw this basic weakness and by avoid- 
ing it achieved everlasting distinction (Mathews, 1967, p. 7). 

If one accepts the fact that language is oral and writing is a 
representation of speech, and further that the ability to read involves, 
either consciously or subconsciously, the translation from written 
symbols to sound, it must follow that accurate information about the 
symbol-sound relationships of English is needed. Until this decade 
little information of this nature had been accumulated scientifically. 
Spelling reformers had, perhaps, contributed the greatest quantity of 
literature on English orthography, but their arguments were based on 
the assumption that alphabets should be perfectly phonetic or phonemic, 
i.e., for each sound there should be a letter. Nevertheless, many 
interesting observations of the nature of speech came from reformers 

i 

such as Hart. 

Since the 16th Century, studies of symbol-sound relationship in 
English have been published. Abercrombie (1948) and Dobson (1959) 
survey most (if not all) of the earlier works. Perhaps the most com- 
plete analysis of spelling- to-sound correspondences during those early 
times was that of Douglas in 1740 (Ilolmberg, 1956). In fact, some of 
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the more recent descriptions of letter-sound relationships are inferior 
to the work of Douglas. 

Robert Hall published a monograph barely a decade ago, concerned 
with the relationship between letters and sounds (1961). His primary 
intent was to present solid linguistic information which might contri- 
bute to the demise of the then prevalent "lock-say” method of reading 
instruction. His work includes lists of English phonemes and their 
various graphemic representations. Hall's contention that many of the 
"irregularities" in English were intentionally devised some 500 years 
ago to keep reading and writing in the hands of the upper classes, 
runs contrary to language history. However, he feels the only way to 
teach reading effectively is to establish in the learner's mind a 
correlation between letters and sound (p. 60). 

In 1961 Venezky (1963) developed a computer program to derive and 
tabulate spel ling- to- sound correspondences in a corpus of 20,000 
common English words. The computer analysis provided: 

A complete tabulation of the spelling- to-sound correspondences 
in a corpus, based upon the position of consonant and vowel clusters 
within the printed words. For any continuous string of vowels or 
consonants found in a printed word, the tabulations include all of 
the pronunciations found for that string, along with the totals and 
percentages for each pronunciation in each word-position (initial, 
medial and final), and complete word lists for each correspondence 
found . . (Venezky, 1967), 

Weir (1964) advanced the hypothesis that if the writing system 
of English is viewed as a morphophonemic system, there is a much 
greater degree of regularity evident than if a letter- to-sound rela- 
tionship were assumed. That study and later work by Weir and Venezky 
(1965) lent considerable support to this hypothesis. One of the 

O 
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results of their analyses was the description of language-dependent 
units on the graphemic level, called functional units, which are 
significant for the prediction of sound from spelling. Functional 

units are divided into two classes — relational units and markers. 

i 

Relational units refer to a string of one or more i graphemes which 

i 

,j 

relate directly to a morphophonemic correspondent^ for example, _t — >/ 1/ 
and j>h — >/f/ not / p/ + /h/. Markers are one or more graphemes whose 
primary function is to indicate the correspondenc 
or to preserve a pattern, For example, the ^e in 

The major relational units include simple and compound consonants, 
single vowels, and vowel clusters. 

Venezky 1 s and Weir’s work showed that many 1' 
correspondences are very predictable, while other 



gs of relational units, 
cate marks a — =>/e/. 



Lfetter- to-sound 
:Js are not. F. is / f / 

in all English words except o£ , for example, and jc is /k / before a, o_ 



and u and / s / before £, i , and % (with certain exceptions, cello , 

social ) , but £0 may be either /u/, /u/ or /a/ before d_ as in food , 

* 

good and blood . 

Other examinations of letter-sound correspondences have been 
conducted by Oaks, Fry, Clymer, and Burmeister (Burmeister, 1968). 

The principal purposes of these studies were to test the usefulness 
of commonly taught phonics generalizations. These writers generally 
concluded that many of the phonics "rules" being taught were of little 
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All phonemic symbols are from the International P 
devised by the International Phonetic Association 
phonemic symbols of American English is given in Aj 
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value because of the numerous exceptions. These studies, though inter- 
preted by the authors to show the irregularities of English, point out 
in a limited way what the computers have divulged more thoroughly-- 
English orthography is not the highly irregular system many have 
thought it to be. 

Hanna approached the problem of symbol-sound relationships from 
the other direction, that is, from sound to symbol (Hanna, e_t al . , 1966) 
Since Hanna ej: ail, , were concerned with spelling rather than reading 
improvement, they tabulated the different spellings for a given sound, 
rather than the different sounds for a given spelling. They developed 
a 17,000 word corpus extracted from the Thorndike-Lorge, Teacher' s 
Word Book of 30,000 words and Merriam-Webs ter , New Collegiate Dictionary 
Their computer analysis of the corpus provided a complete analysis of 
sound to spelling correspondences in these English words. 

To test the utility of certain phonic generalizations, Burmeister 
attempted to identify the most common sounds of each vowel pair through 
an analysis of the af ormentioned Hanna study (Burmeister, 1968), 

However, her tables list only 26 of the many vowel clusters listed by 
Hanna; among the omitted clusters is io , which is the most common in 
English. Furthermore, some of her conclusions lack observable support. 
For example, her tables show ea — >/ i / 50.5% and ow — >f of 507o, yet 
she includes them in the category of vowel clusters which can be 
profitably taught with the "two vowels go a 'walking 111 rule. Some of 
her terminology is confusing. "Ordinarily when two vowels appear 
together they should be viewed as a grapheme . . (p. 445), probably 

refers to the fact that contiguous vowels usually represent one phoneme. 
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In general, her investigation seemed less than rigorous. 

While both linguists and educators have examined the relationship 
between symbol and sound in the English language, very little research 
has been conducted on the child 1 s acquisition of the symbol-sound code. 
Descriptions of two such studies follow. 

Biemiller and Levin conducted a study of the latency of oral 
response to words containing digraph spellings (sh, ai , ng ) and "common 
clusters" (s_l, cl). Their study was designed to examine the importance 
of auditory versus visual processing of stimulus words. They sampled 
48 children drawn from the second, third and fourth grades, and pre- 
sented them with words either preserving the intactness of the digraph 
(sh ed) or breaking the two letters (s bed) . 

Results indicated that second and third graders took nearly one 
second longer to read words whose initial and final digraph were broken 
than they did to read words whose digraphs were presented intact. The 
effect did not occur for fourth grade children. There were no latency 
effects attributable to breaking medial, vowel digraph (Levin, et al . , 
1968, p. 178). 

Of relevance to the current study is the conclusion that pro- 
nunciation of vowel clusters is not affected by division of the cluster. 
This suggests that children expect one phoneme rather than two when 
encountering vowel clusters. 

Another investigation of children* s acquisition of symbol-sound 
correspondences was undertaken by Calfee, Venezky and Chapman (1968) 
whose major concern, 

was to find the extent to which the reader used (regular) correspon- 
dences in pronouncing synthetic words, and how they pronounced synthetic 
words for which no such regular correspondence existed (p. IX). 
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They used 40 synthetic words, each typed in sans-sarif capital letters 
and mounted on a 35mm slide, which the subjects were to pronounce. A 
total of 245 students from the third, sixth, eleventh and twelfth 
grades, and college, participated in the study. Their general conclu- 
sion was that good readers consistently gave more appropriate responses 
to predictable letter-sound correspondence patterns than poor readers, 
though no group--even the oldest and the best readers--gave appropriate 
responses all the time. 

Some of the synthetic words contained vowel clusters which have, 
as noted earlier, unpredictable letter-sound correspondences. Calfee, 
jet al . , found , 

With the exception of ea and e£, overall agreement on a preferred 
pronunciation for a vowel digraph was not high; neither was there high 
agreement on specific items. Shifts in pronunciation of a given digraph 
as a result of context were observed, however, suggesting that choice 
of pronunciation may be contextually bound. Whatever the moderating 
mechanism, the spread of observed pronunciations for most digraph 
spellings suggests that it tends to be idiosyncratic (p . 167). 

No other research regarding children's pronunciations of vowel 
clusters has been undertaken, to the knowledge of this investigator. 

While developing reading ability, children’s generalizations 
result more often from example than from rule. Therefore, more infor- 
mation is needed about how children generalize from language input 
data, and the first step is to know what the input is. Since reading 
involves the ability to translate written symbols to sound, and since 
the letter-sound correspondences of vowel clusters are generally not 
predictable, awareness of the frequencies of the several pronunciations 
of each vowel cluster spelling in common English words is required. 
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Nearly one-third of the most common 20,00(3 words, and one-fifth 
of the 1,000 most frequent words in English contain vowel cluster 
spellings. Unless each of these is to be learned ;as a sight word, 
which is not practicable, a listing of all vowel clusters in English, 
and the phonemes and phonemic strings they represent, is needed. This 
information is an essential basis for an investigation of children's 
generalization behavior with vowel cluster spellings. 

What do young readers do when they encounter familiar vowel 
clusters in unfamiliar words? When they come upon an unfamiliar word 
with an ea cluster, do they attempt / i/ as in teach , or /e/ as in dea d , 
or /e/ as in great? Accurate and all-encompassing generalizations 
about compound vowel pronunciations cannot be taught, as they can with 
many other letter-sound correspondences--^ usually is / 1/ , £ is 
usually /k / before a 9 o, and u, etc.). Information is needed about 
how young readers pronounce unfamiliar words containing vowel clusters. 
Flexibility has long been a goal of reading instruction. Are children 
flexible readers? For example, when asked to pronounce unfamiliar 
words containing ow, will a child always give the /o/ pronunciation as 
in grow , or the /au/ sound as in now , or will he vary his pronunciations? 
Research is needed which will relate the pronunciation preferences of 
children to actual characteristics of the language. 

R ationale for the I nvestigation 

Vowel clusters are perhaps the most complex and unpredictable 
components of the letter-sound correspondence code. Vowel cluster 
spellings differ from single vowel spellings in several ways. They 
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rarely appear before geminate consonant clusters; some, such as aJL 
and au occur infrequently in word-final position. Others, ioa, ie, etc., 
rarely begin a word in English. 

Some vowel clusters have a major phonemic correspondent, and 
possibly several minor correspondents. The major correspondent of ^ai 
is /e/ as in bait , and it represents this sound in 85% of its occurrences. 
It represents /a/, villain ; /ai/, aisle , /e/> agai n; /&/ , plaid ; and 
others much less frequently. Other vowel clusters have two or more 
major correspondents, as well as minor correspondents. For example, 
ow is /o/ as in ow n 5 1% of the time and /au/,as in ow l 48%. Its only 
minor correspondent is /a/ as in knowledge . On the other hand, all 
single .vowel spellings have two major correspondents, (e.g., a — >/e/ 
or /es/) plus several minor correspondences. 

While single-vowel spellings can be traced to the earliest English 
writing, most vowel cluster spellings are much more recent, having 
been introduced during the late Middle English period. Consequently, 
vowel cluster correspondences underwent considerably fewer sound 
changes than did single-vowel spelling, though they did undergo some 
change (Mosse, 1952). For example, the Middle English diphthong /au/, 
spelled ajj or aw, developed in a complex manner (including French 
borrowings). With Che Great Vowel Shift, Middle English /au/ became 
/o/, though the au-aw spelling was retained. The vowel cluster oo 
first appeared in the 14th Century to represent / o : / ,. but did not 
become established until the 16th Century. /o:/ changed to /u:/ 
through the Great Vowel Shift, and to /u/ in Modern English. In some 
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oo spelled words, /u:/ shortened to /u/ as in book , and was later 
unrounded to /a/ as in flood (Venezky, 1963). 

The goals of the present study are to examine vowel c lus ter- sound 
relationships. Given that certain factors of pronunciation exist, 
(that is 3 in common English words many vowel clusters have six or more 
pronunciations), what factors influence a child's pronunciations of 
vowel-cluster words? Are good readers 1 pronunciations more often in 
the range of theoretical possibility than poor readers? What differ- 
ences arise as children progress through the elementary grades? Is 
pronunciation related to community environment or sex? Does consonant 
environment affect pronunciation choice? 

The present study seeks to answer questions about the relation- 
ships between the pronunciations of vowel clusters by a representative 
sample of elementary school children, and vowel clus ter--phoneme 
correspondences in a large corpus of common English words. Such 
information should provide a source for the modification of beginning 
reading materials and methods. For example, ari is /o/ as in caus e 
in 90% of common English words but is /&/ as in laugh in only two 
words and their derivations. Since laugh and aunt are frequently 
taught as "sight" words in early reading, it might be expected that 
children develop a false generalization about ari which they apply to 
unfamiliar words they encounter--even though letters and sounds are 
not stressed with "sight" words. The present study will reveal the 
extent to which this happens with the different types of children 
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Definition of Terms 

Several terms are important to an .understanding of this study. 

They are defined as follows: 

Vowel Cluster : a group of letters composed of two or more 

contiguous vowel graphemes. It is used synonomously with compound 
vowel, vowel pair, or vowel digraph. There are 61 different vowel 
clusters in the corpus of 20,000 common English words used by Venezky, 
1963; some occur in only one word and one occurs in more than 1,000 
words. ( aie occurs in one word--g aiety , while _io occurs in 1,293 words-- 
ac tion , r atio , lion , etc.) 

Letter - Sound Correspondenc e : a grapheme- phoneme relationship. 

Many letters, like d_ , f> 1 and z have invariant or nearly invariant 
pronunciations (_d — >/d/, _f — >/f/ except in o£_ where JE — $>/v/) and are, 
therefore, predictable. Some letters, like Jb , £, and m have variant 
letter-sound correspondences which can be predicted on the basis of 
grapheme environment (c. — >/ s/ before jb, jL, or y-- cell , city , cyst, 
otherwise £ — >/ k / ) . Each single letter vowel, (^-^-i-o-u-y) , has two 
major correspondences (long and short as in cape and cap) , which are 
usually predictable, and other less predictable pronunciations. Vowel 
clusters generally have several letter-sound correspondences (ou — >/au/, 
/u/ 5 and /u/ as in found , would and you ) which, in most instances, 
are not predictable. 

20,000 Wor d Corpus : a 1963 modification of the Thorndike list 

of 20,000 most frequent English words (Venezky, 1963). The original 
Thorndike list was revised by Venezky primarily through deletion of 
archaic words and addition of new words. The revised list was programmed 
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for a computer analysis of the letter-sound correspondence therein. 

The output included a complete tabulation of spelling- to-sound corres- 
pondences, along with word lists for each correspondence. In addition, 
a sound- to- spelling correspondence listing, a reversed spelling listing, 
and a reversed pronunciation listing was obtained. An analysis of the 
corpus by this investigator revealed 61 different vowel clusters, 
representing 92 different phonemes and phoneme strings for a total of 
more than 300 letter-sound correspondences in over 6,000 words. 

1,000 Word Corpus : the 1,003 most frequent English words derived 

from a corpus of 1,000,000 running words by Kucera and Francis (1967). 
This corpus is the most recent and certainly the most exhaustive 
:omputer tabulation of word frequencies to date. The 1,000 word 
corpus used in this study contains the 1,003 most frequently written 
American English words, ranging from the most frequent, the , which 
occurs 69,971 times per million running words, to the 1,003rd most 
frequent (11 words, each occurring 106 times per 1,000,000 running 
words-- appli ed , reach , etc.). 

Type : a "distinct word," viewed as one word regardless of how 

frequently the word appears. ( the and a pplied are considered distinct 
words regardless of their frequencies.) 

Token : an "individual word" considered for this study in terms 

of frequency of its appearance. In this study pronunciation frequencies 
of words in the 1,000 word corpus are based on tokens, while pronun- 
ciations in the 20,000 word corpus are based on types. The following 
example is offered for clarification. In the 1,000 word corpus there 
are five words which contain the au spelling. Of these five, four 
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have the /o/ pronunciation (because, etc.), while one has the /ae/ 
pronunciation ( laugh ) . With a token description-based on number of 
occurrences of each word in 1,000,000 running words, /q/ is equal to 
91.34% and /ae/ 8*57%. By comparison, if the pronunciation frequencies 
were based on types, /o / would equal 80% and /ae/ 20%. 

Reading Ability : performance on a standardized reading test. All 

subjects in this study received a grade-level equivalent reading score, 
on such standardized reading tests as the Metropolitan Achievement or 

tne Iowa Test of Basic Skills (see Appendices D, E, and F) . 

\ 

Intelligenc e: performance on a standardized group intelligence 

test (see Appendices D, E, and F) . 

\ 

Pis tractor : a multiple choice response item. 

\ 
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Chapter II 

SELECTION OF VOWEL CLUSTERS 



This study was designed to achieve two broad objectives: (1) to 

investigate elementary pupils 1 pronunciations of vowel clusters in 
unfamiliar words, and (2) to analyze the observed pronunciations in 
relation to existing letter-sound correspondences of vowel clusters in 
common English words. This chapter contains a description of the 
analysis of the vowel clusters in 20,000 common English words, and the 
procedures followed in the selection of representative vowel clusters 
used in this investigation. The symbol-sound correspondences of the 
selected vowel clusters as they occur in the 1,000 most frequent 
English words are presented also. 

Analysis of V owel C luste r Letter - Sound Correspondences 
As part of an inter-disciplinary study of the reading process begun 
at Cornell University in 1961, Venezky developed a computer program to 
derive and tabulate letter-sound correspondences in a corpus of 20,000 
common English words (Venezky, 1963). The 20,000 word corpus was a 
modification of the most common 20,000 words according to the Thorndike 
frequency count (Thorndike, 1941). Venezky omitted many archaic and 
low frequency words, particularly proper nouns, and added a number of 
words in their place. Along with other information, the computer 
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